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Abstract. Let (Xjk)jk^i be i.i.d. nonnegative random variables with bounded density, 
mean m, and finite positive variance a 2 . Let M be the n x n random Markov matrix 
with i.i.d. rows defined by Mjk = Xjk/{Xj\ + ■ ■ ■ +Xj n ). In particular, when Xu follows 
an exponential law, the random matrix M belongs to the Dirichlet Markov Ensemble 
of random stochastic matrices. Let Ai, . . . , A„ be the eigenvalues of y/nM i.e. the roots 
in C of its characteristic polynomial. Our main result states that with probability one, 
the counting probability measure — 5\ 1 + ■ • ■ + — Sx n converges weakly as n — > oo to the 
uniform law on the disk {z £ C : \z\ ^ m~ 1 (j}. The bounded density assumption is 
purely technical and comes from the way we control the operator norm of the resolvent. 
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The eigenvalues of an n x n complex matrix A are the roots in C of its characteristic 
polynomial. We label them Xi(A), . . . , X n (A) so that |Ai(yl)| ^ • • • ^ |A„(j4)| with growing 
phases. The spectral radius is |Ai(A)|. We also denote by si(A) ^ • • • ^ s n (A) the singular 
values of A, defined for all 1 ^ k ^ n by Sk(A) := Xj~(V AA*) where A* = A T is the 
conjugate-transpose. The matrix A maps the unit sphere to an ellipsoid, the half-lengths 
of its principal axes being the singular values of A. The operator norm of A is 

||-A||2->2 := max ||^4x||2 = si(A) while s n (A) = min ||^4x|| 2 . 
IMl2=i IMI 2 =1 

The matrix A is singular iff s n (A) = 0, and if not then s n (A) = s\(A^ 1 )^ 1 = ||^4 _1 ||^.2- 
If A is normal (i.e. A*A = A* A) then Si(A) = \Xi(A)\ for every 1 ^ i ^ n. Beyond normal 
matrices, the relationships between the eigenvalues and the singular values are captured 
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by the Weyl inequalities (see lemma lB.6p , Let us define the discrete probability measures 

^ n \ n 

^ A '-=-Y, 5 MA) and u A--=-J2 § s k (A)- 
k=l k=l 

From now on, we denote "— >" the weak convergence of probability measures with respect 
to bounded continuous functions. We use the abbreviations a.s., a. a., and a.e. for almost 
surely, Lebesgue almost all, and Lebesgue almost everywhere respectively. The notation 
n>l means large enough n. Let (-Xjj)i,j>i be an infinite table of i.i.d. complex random 
variables with finite positive variance < a 2 < oo. If one defines the square nxn complex 
random matrix X := (Xj j)i^ij^.n then the quartercircular law theorem (universal square 
version of the Marchenko-Pastur theorem, see |31|. 145 } 14"?]) states that a.s. 

v i x A Q a (1.1) 

Tjn n— >-oo 

where Q a is the quartercircular law on the real interval [0, 2a] with Lebesgue density 

x \-t — g \/4o- 2 - a? 2 l [0 2a] (x). 

Additionally, it is shown in [51 El E] that 

lim ai(-M0 = 2a a.s. iff E(X 11 ) = and E(\Xi i| 4 ) < oo. (1.2) 
Concerning the eigenvalues, the famous Girko circular law theorem states that a.s. 

fii x — > W CT (1.3) 

vAi n— >oo 

where lA a is the uniform law on the disc {z £ C : \z\ ^ a}, known as the circular law. This 
statement was established through a long sequence of partial results [32], EH [2H EHl EH1 
E3 IS [331 ESI SOI HI], the general case CLU} being finally obtained by Tao and Vu @TJ. 
From (|1.3p we have a.s. hm n _ s>00 |Afc(n _1 / 2 X)| ^ cr for any fixed k ^ 1, and we get from 
[71 [33] and (H2J that if additionally E(X Xtl ) = and E(|Xi j:l | 4 ) < oo then a.s. 

lunJX 1 ^X)\=a and ^ ^j§U 2. (1.4) 



The behavior of the ratio operator- norm/spectral-radius suggests that X is far from being 
an asymptotically normal matrix. Following [37] [16] , if E(.Xi i) 7^ while E(|Xii| ) < 00 
then a.s. |Ai(n _1 / 2 X)| — > +00 at speed y/n while \\2(n~ 1 / 2 X)\ remains bounded. 

The proof of (|1.3p is partly but crucially based on a polynomial lower bound on the 
smallest singular value proved in [3D]: for every a, d > 0, there exists b > such that for 
any deterministic complex nxn matrix A with s±(A) sj n d we have 

P(s n {X + A) < n~ b ) < n~ a . (1.5) 

In particular, by the first Borel-Cantelli lemma, there exists b > which may depend on 
d such that a.s. X + A is invertible with s n (X + A) ^ n~ b for n S> 1. 

1.1. Random Markov matrices and main results. From now on and unless otherwise 
stated (Xi t j)ij^i is an infinite array of nonnegative real random variables with mean 
m := E(Xi 1) > and finite positive variance a 2 := E(X 2 X ) — m 2 . Let us define the event 

£>n := {p n ,l ■ ■ ■ Pn,n > 0} where p n>i := X iA H h X i>n . 

Since a > we get g := P(Xl ; i = 0) < 1 and thus 

oo oo oo 

= E(! - c 1 - « n ) n ) < E n ? n < °°- 

n=l n=l n=l 
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By the first Borel-Cantelli lemma, a.s. for n>l, one can define the n x n matrix M by 

M ir .= ^. 
,J n ■ 

The matrix M is Markov since its entries belong to [0, 1] and each row sums up to 1. We 
have M = DX where X := (X t 

,j)i^i,j^n and D is the n x n diagonal matrix defined by 

Di,i := — . 

Pn,i 

We may define M and D for all n ^ 1 by setting, when p n> j = 0, Mjj = 5ij for all 1 ^ j ^ n 
and = 1. The matrix M has equally distributed dependent entries. However, the rows 
of M are i.i.d. and follow an exchangeable law on K™ supported by the simplex 

A n := {(pi,...,p n ) G [0,1]" :pi + •••+?„ = 1}. 

From now on, we set m = 1. This is actually no loss of generality since the law of the 
random matrix M is invariant under the linear scaling t — > tX{j for any t > 0. Since 
<7 < oo, the uniform law of large numbers of Bai and Yin [HJ lem. 2] states that a.s. 

max \p n i — n\ = o(n). (1-6) 

This suggests that \JnM is approximately equal to n~ l / 2 X for n S> 1. One can then 
expect that (jl.ip and (jl.3p hold for y/nM. Our work shows that this heuristics is valid. 
There is however a complexity gap between (jl.ip and fjl .3[) . due to the fact that for 
nonnormal operators such as M, the eigenvalues are less stable than the singular values 
under perturbations, see e.g. the book [33]. Our first result below constitutes the analog 
of the universal Marchenko-Pastur theorem (jl.ip for y/nM, and generalizes the result of 
the same kind obtained in [T7j in the case where X\ \ follows an exponential law. 

Theorem 1.1 (Quartercircular law theorem). We have a.s. 



Our second result provides some estimates on the largest singular values and eigenvalues. 
Theorem 1.2 (Extremes). We have X\(M) = 1. Moreover, i/E(|Xi i| ) < oo then a.s. 
lim si(M) = 1 and lim s 2 (VnM) = 2a while Tim \X 2 (VnM)\ < 2a. 

n— >oo n— >oo n— >co 

Our third result below is the analogue of (jl.3p for our random Markov matrices. When 
Xi t i follows the exponential distribution of unit mean, theorem 11.31 is exactly the circular 
law theorem for the Dirichlet Markov Ensemble conjectured in \17\ 1 15 j . Note that we 
provide, probably for the first time, an almost sure circular law theorem for a matrix 
model with dependent entries under a finite positive variance assumption. 

Theorem 1.3 (Circular law theorem). If Xn has a bounded density then a.s. 

7 / 

^JnM U cr- 
v n— >oo 

The proof of theorem 11.31 is crucially based on the following estimate on the norm of 
the resolvent of ^/nM . It is the analogue of (jl.5p for our random Markov matrices. 

Theorem 1.4 (Smallest singular value). If X\\ has a bounded density then for every 
a, C > there exists b > such that for any z £ G with \z\ ^ C, for n> 1, 

P(s n (v^M - zl) < n- b ) < n- a . 

In particular, for some b > which may depend on C, a.s. for n^> 1, the matrix ^/nM—zI 
is invertible with s n (y/nM — zl) ^ n~ b . 
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The proofs of theorems 11.1111. 2lfl~3lll, 41 are given in sections [2][3][1]{5] respectively. These 
proofs make heavy use of lemmas given in the appendices lAllBllCl 

The matrix M is the Markov kernel associated to the weighted oriented complete graph 
with n vertices with one loop per vertex, for which each edge i —> j has weight Xij. The 
skeleton of this kernel is an oriented Erdos-Renyi random graph where each edge exists 
independently of the others with probability 1 — q. If q = then M has a complete 
skeleton, is aperiodic, and 1 is the sole eigenvalue of unit module The nonoriented 
version of this graphical construction gives rise to random reversible kernels for which a 
semicircular theorem is available [12] . The bounded density assumption forces q = 0. 

Since M is Markov, we have that for every integer r ^ 0, 



1 n 1 n 

m (dz) = - VAI(M) = - Vp M (r,i) (1.7) 
n ^-^ n ^-^ 

i=i i=i 



where 



p M (r, i) ■= ^2 Mil ,ia • • • M i r -i,h 

l^ii,...,i r ^n 
il=i r =i 

is simply, conditional on M, the probability of a loop of length r rooted at i for a Markov 
chain with transition kernel M. This provides a probabilistic interpretation of the moments 
of the empirical spectral distribution \im of M. The random Markov matrix M is a random 
environment. By combining theorem 11.21 with theorem 11.31 and the identity (|1.7p , we get 
that for every fixed r ^ 0, a.s. 



lim ra2 I I y~]p M (r, i) ] =- 0. 

»— ■ n \ n L — ' n J 



I. 2. Discussion and open questions. The bounded density assumption in theorem 11.31 
is only due to the usage of theorem 1 1.41 in the proof. We believe that theorem ll.4l (and thus 

II. 3p is valid without this assumption, but this is outside the scope of the present work, see 
remark 15.11 and figure [TJ Our proof of theorem 11.31 is inspired from the Tao and Vu proof 
of (|1.3p based on Girko Hermitization, and allows actually to go beyond the circular law, 
see remarks 14. 1114.21 Concerning the extremes, by theorems II. 2111.31 a.s. 

cr ^ lim \\ 2 {VnM)\ ^ lim \\ 2 {y/nM)\ < 2a. 

n— >oo n— s>oo 

Also, a.s. the "spectral gap" of M is a.s. of order 1 — 0(n -1 / 2 ) (compare with the results 
of [25]). Note that in contrast with (|1.4p . we have from theorems 1 1 . 1 II 1 . 21 a.s. 

lim - 7—^ = 1. 

n^oo |Ai(M)| 

Numerical simulations suggest that if E(|Xi 5 i| 4 ) < 00 then a.s. 

lim \\2(\/nM)\ = a and thus lim 2 - ' | . = 2. 
n— Yoo n— >oo A2(-/W) 

Unfortunately, our proof of theorem ll.2l is too perturbative to extract this result. Following 
[TTl fig. 2], one can also ask if the phase Phase(A2(M)) = \2{M)/\ \ 2 {M)\ converges in 
distribution to the uniform law on [0, 2ir] as n — > 00. Another interesting problem concerns 
the behavior of max2^fc^ n yit(Xk(\/nM)) and the fluctuation as n — > 00 of the extremal 
singular values and eigenvalues of y^M, in particular the fluctuation of \2(y/nM). 

Classical results on the connectivity of Erdos-Renyi random graphs |2U[ [TU] imply that 
a.s. for n S> 1 the Markov matrix M is irreducible. Hence, a.s. for n 3> 1, the Markov 
matrix M admits a unique invariant probability measure k. If k is seen as a row vector in 
A n then we have kM = k. Let T := n~ 1 (5i + • • • + S n ) be the uniform law on {1, . . . , n}, 
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which can be viewed as the vector ra _1 (l, . . . , 1) G A n . By denoting ||-|| TV the total 
variation (or i 1 ) distance on A n , one can ask if a.s. 

lim ||k — T|| TV = 0. 

n~ >oo 

Recall that the rows of M are i.i.d. and follow an exchangeable law rj n on the simplex 
A n . By "exchangeable" we mean that if Z ~ r\ n then for every permutation % of {1, . . . , n} 
the random vector (Z^m, ■ ■ ■ , Z n i n )) follows also the law r/ n . This gives 

= Var(l) = Var(Zi + h Z n ) = nVar(Zi) + n(n - l)Cov(Zi, Z 2 ) 

and therefore Cov(Zi, Z 2 ) = — [n— l) _1 Var(Zi) ^ 0. One can ask if the results of theorems 
ll.im.2lfL3lll.4l remain essentially valid at least if M is a real n x n random matrix with 
i.i.d. rows such that for every 1 ^ i, j ^ f ^ n, 

E(M id ) = - and < Var(M iJ ) = 0(n~ 2 ) and \Cov{M id , M iif )\ = 0(n~ 3 ). 

These rates in n correspond to the Dirichlet Markov Ensemble for which X\ \ follows an 
exponential law and where r] n is the Dirichlet law T> n (l, . . . , 1) on the simplex A„. Another 
interesting problem is the spectral analysis of M when the law of Xi x has heavy tails, 
e.g. Xi t \ = V~P with 2/3 > 1 and where V is a uniform random variable on [0,1], see 
for instance [11 J for the reversible case. A logical first step consists in the derivation of 
a heavy tailed version of (|1.3p for X. This program is addressed in a separate paper 
|13| . In the same spirit, one may ask about the behavior of fJ-xoA where "o" denotes the 
Schur-Hadamard entrywise product and where A is some prescribed profile matrix. 



2. Proof of theorem 11.11 
Let us start by an elementary observation. The second moment ? n of ls given by 

e u^ M {dt) = I ^ = Tr (MM*) = £ + ^ 

_1 _i \ i,l ' ' ' ' ' ^i,n) 

By using (|1 .6j) together with the standard law of large numbers, we get that a.s. 

1 n 

qn ^ nHi + odW ^ X ^ = {1 + a2) + o(1) = 0(1) - {2A) 

i,j=l 

It follows by the Markov inequality that a.s. the sequence (i / v ^M))n>l is tight. However, we 
will not rely on tightness and the Prohorov theorem in order to establish the convergence 
of {v '^m)ti>1- We will use instead a perturbative argument based on the special structure 
of M and on (|1.6p . Namely, since V^M = nDn-^X, we get from ()B.4|) . for all 1 ^ i ^ n, 

s n (nD) Si (n- 1/2 X) < Si(y/nM) < Sl {nD) Si {n- 1/2 X). (2.2) 

Additionally, we get from ()L6p that a.s. 

lim max \nDa — 1| = and lim max |n -1 D~/ — 1| = 0. 

This gives that a.s. 

s\{nD) = max \nDi^\ = 1 + o(l) and s n (nD) = min \nDi i\ = 1 + o(l). (2.3) 

From ([22]), (123]), and CE3]), we get that a.s. for n > 1, 

s n ( v / nM) > and s n (n" 1/2 X) > (2.4) 
and from (|2.2|) and (|2.3p again we obtain that a.s. 

max log(s i ( v / ^M)) -log(s i (n- 1 / 2 X)) = o(l). (2.5) 



6 CHARLES BORDENAVE, PIETRO CAPUTO, AND DJALIL CHAFAI 

Now, from (jl.ip . and by denoting C a the image probability measure of Q a by log(-), a.s. 



1 n 



n ^ 3 log( S! (n-i/2X)) ^ £*■ 
i=l 

Next, using (|2.5p with lemma IC . 1 1 provides that a.s. 

1=1 

This implies by the change of variable t \— > e* that a.s. 



1 n f 



/ v / nAf . ^ , , , 

i=l 



which is the desired result. 



Remark 2.1 (Alternative arguments). From (|2.2p and (|2.3p . a.s. /or n»l, Sfc(M) = 
iff Sk(X) = 0, /or all 1 ^ k ^ n, and thus io^m({0}) = v n -i/2 X ({Q}), and hence the 
reasoning can avoid the usage of (|2.4|) . Note also that (|2.4|) is automatically satisfied 
when Xii is absolutely continuous since the set of singular matrices has zero Lebesgue 
measure. On the other hand, it is also worthwhile to mention that i/E(|Jfn| ) < oo then 
one can obtain the desired result without using (|2.4p . by using lemma YC.1\ together with 
(|3.2p . and this reasoning was already used by Aubrun for a slightly different model [3]. 



3. Proof of theorem 11.21 

Let us define the n x n deterministic matrix S := E(X) = (1, . . . , 1) (1, . . . , 1). The 
random matrix n~ l l 2 (X — S) has i.i.d. centered entries with finite positive variance a 2 
and finite fourth moment, and consequently, by (|1.2p . a.s. 

Sl {n- l l 2 (X - S)) = 2a + o{l). 

Now, since rank(n _1 / 2 S') = 1 we have, by lemma IB. 51 

s 2 {n- 1/2 X) ^ Sl (n' 1/2 (X - S)) 

and therefore, a.s. 

s 2 (n- 1/2 X) s£ 2<r + o(l). (3.1) 
By combining (|2.2p with (|2.3|) and (|3.1j) we get that a.s. 



max \ Si (y/nM) - Si(n" 1/2 X)| = o(l). (3.2) 

In particular, this gives from (|3.ip that a.s. 

s 2 {VnM) < 2(7 + o(l). (3.3) 
From theorem II. 1\ since Q a is supported by [0,2a], we get by using (|3.3p that a.s. 

lim s 2 (VnM) = 2a. (3.4) 

Next, since M is a Markov matrix, it is known that 

Ai(M) = 1. (3.5) 

Let us briefly recall the proof. Ifu:=(l,...,l) T then Mu = u and thus 1 is an eigenvalue 
of M. Next, let A € C be an eigenvalue of M and let x S C n be such that x 7^ and 
Mi = Ax. There exists 1 ^ i ^ n such that |xj| = max{|xi|, . . . , \x n \}. Since 7^ and 

n n 
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we get |A| ^ 1, which implies (|3.5p . Let us show now that a.s. 

lim si(M) = 1 (3.6) 

n— >oo 

Let S be as in the proof of theorem 11.11 From (jB.ip and fjl .2[) . we get a.s. 

si(v^M) si(raL>)si(n" 1/2 X) 

< s 1 (nD)s 1 (n- 1 / 2 (X - S) + n" 1 / 2 ^ 

< ^(nD)^!^- 1 / 2 ^ - 5)) + Sl (n-^ 2 S)) 

= (l + (l))(2a + (l) + n 1/2 ) 

which gives lim n _ >00 s\{M) ^ 1 a.s. On the other hand, from ()B.6j) and ()3.5p we get 
si(M) ^ |Ai(M)| = 1, which gives (|3.6p . It remains to establish that a.s. 

lim \\ 2 {yfnM)\ 2o\ 

n— >oo 

Indeed, from (|B.6P we get for every non null n x n complex matrix A, 

w r^i < S ^ A > 2 ^ 
|A2(A)I < • 

With A = -y/nM and by using ()3.4ti3.5tf3~6]) . we obtain that a.s. 

Sm \X 2 (VnM)\ < lim s 2 {\fnM) lim si(M) = 2a. 

n— >oo n— >oo n— >oo 

4. Proof of theorem 11.31 
Let us start by observing that from the Weyl inequality ()B.9p and (|2.ip . a.s. 



*l ^m(^) < / * ^m(^) = 0(1). 
c Jo 

This shows via the Markov inequality that a.s. the sequence (f^^M)n^i is tight. However, 
we will not rely directly on this tightness and the Prohorov theorem in order to establish 
the convergence of (/^y^jvf)n>i- We will use instead the Girko Hermitization of lemma IAJ2J 
We know, from the work of Dozier and Silverstein [18], that for all z G C, there exists a 
probability measure v z on [0, 00) such that a.s. {v n -i/2 X - z i)n^i converges weakly to v z . 
Moreover, following e.g. Pan and Zhou |33t lem. 3], one can check that for all z G C, 



UuAz) = ~ \og{t)u z {dt). 







To prove that a.s. (/^^Af tends weakly to U a , we start from the decomposition 

</nM-zI = nDW where W := n' 1 / 2 X - zn~ l D~ l . 
By using (|B.2p . (|1.6p . and lemma we obtain that for a. a. z G C, a.s. 



v w — > v z . 



Now, arguing as in the proof of theorem II .![ it follows that for all z G C, a.s. 

v JnM-zl ~~ r V Z- 
v n— >oo 

Suppose for the moment that for a. a. z G C, a.s. the function log(-) is uniformly integrable 
for {y^M-zi)n^\- Let 'P(C) by as in section lAl Lemma [A.2I implies that there exists 
jU G V(G) such that a.s. 

eg 

VJEM — > M and ^ = ^ a.e. 
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where Un is the logarithmic potential of /i as defined in section [A) Now by lemma lA.l} we 
obtain fx = U a , which is the desired result. It thus remains to show that for a. a. z G C, 
a.s. the function log(-) is uniformly integrable for {y^M-zl)n^\- For every z G C, a.s. by 
the Cauchy-Schwarz inequality, for all t ^ 1, and n>l, 

3 \ 2 poo 

l og( s )Vy^M-zi(ds)) v,/fiM-zi([t, oo)) J s 2 u^ M _ zI {ds). 

Now the Markov inequality and (|2.ip give that for all z G C, a.s. for all t ^ 1 

0(1) 



log(s) V^M-zl{ ds ) < t 2 

where the O(l) is uniform in i. Consequently, for all z G C, a.s. 



roo 

lim lim / log(s) v^ M -zi( ds ) = °- 



This means that for all z G C, a.s. the function lj l oo ) log(-) is uniformly integrable for 
iy^fiM-zl)n^l- It remains to show that for all z G C, a.s. the function l(o,i) log(-) is 
uniformly integrable for {y^M-ziln^i- This is equivalent to show that for all z G C, a.s. 

lim lim f - log(s) v^, M - z i{ds) = 0. 
For convenience, we fix z G C and set Sj := Si(y/nM — zl) for all 1 ^ i ^ n. Now we write 

,<5 1 L2"°"J 1 n-1 

-/ log(*)^M- 2 /(^) = - ^^(O^i-Sn-^logis-^) + - l (0i ,5) (s„_i) lo g(^-i) 

i=0 i=|_2n°-"J+l 

1 / _ld2« '"J 1 n-1 

< E 1 ^)^) + - £ l(0^)(^-i)log( S ^). 

i=0 i=L2n0-99j + i 

From theorem 11.41 (here we need the bounded density assumption) we get that a.s. 

InnE ^1^(^ = 

i=0 

and it thus remains to show that a.s. 

j n— 1 



<5->0 n-s>oo n 

i=[_2n°- 99 J+l 



This boils down to show that there exists c$ > such a.s. for n>l and 2n ^ i ^ n— 1, 

^ c -. (4.1) 

n 

To prove it, we adapt an argument due to Tao and Vu [H]. We fix 2ra ' 99 ^ i ^ n — 1 and 
we consider the matrix M' formed by the first n — \i/2] rows of 

Vn(VnM - 2I) = nZ)X - y^z/. 

By the Cauchy interlacing lemma IB.41 we get 

n -V2 J . < - . 

where s^- := Sj(M') for all 1 ^ j ^ n — [i/2] are the singular values of the rectangular 
matrix M' in nonincreasing order. Next, by the Tao and Vu negative moment lemma IB .31 

s'i -2 + • • • + s'~ 2 r . ,„-, = distr 2 + • • • + dist -2 r . Mn , 

1 n—\i/2\ 1 n—\i/2\> 
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where distj is the distance from the j th row of M' to Hj, the subspace spanned by the 
other rows of M' . In particular, we have 

n-\i/2\ 

^n-i<« E dist 7 2 ' ( 42 ) 

3=1 

Let Rj be the j th row of X. Since the j th row of M is DjjRj, we deduce that 

distj = dist(nDjjRj — z\fne^Hj) ^ nDjjdist(Rj,span(Hj ,ej)) 
where e%, . . . , e n is the canonical basis of IR™. Since span(i/j, ej) is independent of Rj and 



i 



lemma (|C.2p gives 



dim(span(fZj, ej)) ^ n ^ n 



n -i n-\i/2\ 



n 



0.99 



E P ( U U |dist(^,span(^ jei ))<^^ ) <oo 



n>l \i=2n°" j=l 

(note that the exponential bound in lemma IC.2I kills the polynomial factor due to the 
union bound over i,j). Consequently, by the first Borel-Cantelli lemma, we obtain that 
a.s. for n>l, all 2n - 99 ^ i < n — 1, and all 1 ^ j ^ n — \i/2] , 

o\Ti r- ° n 
distj ^ n Uj,j — 1= = Vi — - 



2V2 2y/2 pn,j' 

Now, the uniform law of large numbers (|1.6p gives that a.s. 

0. 



lim max 



1 

n 



We deduce that a.s. for n>l, all 2n°' 99 ^ i ^ n — 1, and all 1 ^ j ^ n — |~i/2] . 

disto ^ V^— 
J 4 

Finally, from (|4.2|) we get 

,•2 „2 



S 2 • > — 

n " 1 " n 2 32' 

and (03]) holds with c := a/(4v / 2). 

Remark 4.1 (Proof of the circular law (|1.3p and beyond). The same strategy allows a 
relatively short proof of (jl.3p . Indeed, the a.s. weak convergence of (jtt n -i/2 X ) n >i to ZY CT 
follows from the Girko Hermitization lemma \A~3& and the uniform integrability o/log(-) for 
(v n -i/2 X - z i)n^i as a&owe using (jl.5p . T/iis direct strategy does not rely on the replacement 
principle of Tao and Vu. The replacement principle allows a statement which is more 
general than (|1.3p involving two sequences of random matrices ( the main result of |41j on 
universality). Our strategy allows to go beyond the circular law (jl.3p . by letting E(Xj,-) 
possibly depend on i,j,n, provided that (j 1 . 5 1) and the result of Dozier and Silverstein [18J 
hold. Set A := (E(Xij))i^ij<^ n . If Tr (A A*) is large enough for n 3> 1, the limit is no 
longer the circular law, and can be interpreted by using free probability theory [38J . 

Remark 4.2 (Beyond the circular law). It is likely that the Tao and Vu replacement 
principle [H] allows a universal statement for our random Markov matrices of the form 
M = DX, beyond the circular law, by letting E(Xjj) possibly depend on i,j,n. This is 
however beyond the scope of the present work. 
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5. Proof of theorem 11.41 

Note that when z = 0, one can get some b immediately from (|B.3l II. 5| II. 6p . Thus, our 
problem is actually to deal with z / 0. Fix a, C > and z £ <C with \z\ ^ C. We have 

y/nM - zl - y/nDY where Y := X - n~ xl2 zD~ x . 
For an arbitrary 5 n > 0, let us define the event 

I Pn,i 



A:=n{| 



1=1 



1 

n 



By using the union bound and the Chebyshev inequality, we get P(«4£) a 2 5 n 2 . Now 
with c > a/2 and 5 n = n c we obtain P(A0 ^ rT a for n>l, Since we have 

s n {Dy x = max \p ni \, 

we get by (|B.3p . on the event A n , for n S> 1, 

{s n (v^M - 2l) < t n } C {^s„(D)s n (y) < i n } C {s„(y) < Vnt n (l + n c )} 

for every t n > 0. Now, for every b' > 0, one may select b > and set t n = n~ b such that 
y / nt n (l + n c ) ^ for Thus, on the event A n , for n> 1, 

-Mn := {s n (v^M - zJ) < n- 6 } C {s n (Y) < n" 6 '} =: 3> n . 
Consequently, for every 6' > there exists b > such that for n>l, 

P(M0 = P(M n n AO + P(M„ n A c n ) < PQg + P(^) s$ P(y ft ) + n~ a . 

The desired result follows if we show that for some b' > depending on a, C, for n S> 1, 

P(y n ) = P(s n (Y) ^ n- b ') ^ n- a . (5.1) 

Let us prove (|5.ip . At this point, it is very important to realize that (|5.ip cannot fol- 
low form a perturbative argument based on (jl.5IB.2ll.6p since the operator norm of the 
perturbation is much larger that the least singular value of the perturbed matrix. We 
thus need a more refined argument. We have Y = X — wD -1 with w := n~ l l 2 z. Let 
A w = A n -i/2 z be as in lemma IC. 31 For every 1 k ^ n, let Pk be the n x n permutation 
matrix for the transposition (1, k). Note that P\= I and for every 1 ^ k ^ n, the matrix 
PkA w Pk is n x n lower triangular. For every column vector e« of the canonical basis of R ra , 

(P k AP k )e t = h , 

I e fe — itf(ei H he n ) if z = fc. 

Now, if . . . , i? n and . . . , i?^ are the rows of the matrices X and y then 



Y 



'BSC 



. RL 



( R\P\A W P\ 



y -fn 



Define the vector space R'_ i := span{i?j : j / i} for every 1 ^ i ^ n. From lemma IB. 21 

min dist(i?i, ^ y/ns n (y). 

l^i^n 

Consequently, by the union bound, for any u ^ 0, 

P(y/ns n (T) < «) < n max P(dist(i?-, USA < it). 

l^i^n 

The law of dist(it^, R'_j) does not depend on i. We take i = 1. Let V' be a unit normal 
vector to R'_ 1 . Such a vector is not unique, but we just pick one, and this defines a random 
variable on the unit sphere S n_1 := {x G C n : ||x|| 2 = 1}- Since V G and ||^'|| 2 = 1, 

\R[ • y'| < dist( J R / 1 ,i?'_ 1 ). 
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Let v be the distribution of V' on S" . Since V' and R\ are independent, for any u ^ 0, 

P(dist( J R / 1 ,i?'_ 1 ) < u) < P(|i?i -V'\ <«) = / F(\R\ ■ v'\ u)dv(v'). 

Let us fix v' G $ n_1 . If Pi = /, R\ are as above then 

R\ ■ v = R% • v where v := PiA w Piv' = A w v' . 
Now, since v' G S n_1 , lemma IC. 31 provides a constant > such that for ij>1, 
\\v\\ 2 = \\A w v'\\ ^ min HA^xL = s n (A w ) ^ K~ x . 

But ||u || 2 ^ i^ -1 implies ^ K^/n for some j G {1, . . . , n}, and therefore 

|<Ke(^-)| -1 ^ K\/2n~ or |Jm(^-)| _1 ^ ^V^. 

Suppose for instance that we have |9 c le(uj)| _1 ^ K\^2n. We first observe that 

P(\R[ • v'\ ^u) = P(|i?i ■ u| < u) < P(|!He(i?i • v)\ ^ u). 

The real random variable 9\e(Ri -v) is a sum of independent real random variables and one 
of them is X\j9\z{vj), which is absolutely continuous with a density bounded above by 
BK\^2n where B is the bound on the density of X\ \. Consequently, by a basic property of 
convolutions of probability measures, the real random variable D\e(R\ -v) is also absolutely 
continuous with a density tp bounded above by BK\[2~n, and therefore, 

P(|*He(i?i ■ v)\ < u) = / ip(s) ds ^ BKV2n2u. 

J [— u,u] 

To summarize, for n>l and every u ^ 0, 

P(v^s n (y) ^ u) BK(2nf /2 u. 

Lemma IC. 31 shows that the constant K may be chosen depending on C and not on z, and 
(|5.1[) holds with b' = d+1/2 by taking u = n~ d such that BK(2nfl 2 n~ d ^ for n>l. 

Remark 5.1 (Assumptions). Our proof of theorem \l-4\ still works if the entries of X 
are just independent and not necessarily i.i.d. provided that the densities are uniformly 
bounded and that (|1.6p holds. The bounded density assumption allows to bound the small 
ball probability P(|i?i • v\ ^ u) uniformly over v. If this assumption does not hold, then 
the small ball probability may depend on the additive structure of v, but the final result is 
probably still valid. A possible route, technical and uncertain, is to adapt the Tao and Vu 
proof of (|1.5p . On the opposite side, if Xn has a log-concave density (e.g. exponential) 
then a finer bound might follow from a noncentered version of the results of Adamczak et 
al [T]. Alternatively, if X±% has sub-Gaussian or sub- exponential moments then one may 
also try to adapt the proof of Rudelson and Vershynin |34j to the noncentered settings. 

Remark 5.2 (Away from the limiting support). The derivation of an a.s. lower bound 
on s n {y/nM — zl) is an easy task when \z\ > 2a and ~E(\Xi t i\ 4: ) < oo, without assuming 
that Xi^i has a bounded density. Let us show for instance that for every z G C, a.s. 

s n ( v / nM - zl) ^ \z\ -2a + o{\). (5.2) 

This lower bound is meaningful only when \z\ > 2a. For proving (|5.2p . we adopt a pertur- 
bative approach. Let us fix z G C By (|B,3p and (|1.6p we get that a.s. 

s n (y/nM - zl) ^ n _1 (! + s n (VnX - zD' 1 ). (5.3) 

Now we write, with S = EX = (1, . . . , 1)(1, . . . , 1) T , 

^JnX-zD- 1 = yJnS-znl+W where W := ^/n~{X - S) + nzl - zD' 1 . 
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We observe that (|B,2|) gives 

s n {^X - zD- 1 ) > s n (^S - znl) - s x {W). 
For the symmetric complex matrix ^/nS — znl we have for any z £ C and n>l, 

8 n (y/nS — znl) = nmin(|z|, \y/n — z\) = n\z\. 
On the other hand, since 1E(|Xi i| ) < oo, by ()1.2p and (jl.6p . a.s. for every z £ C, 

si(W) < si(Vn(X - S)) + si(nzJ - zD~ l ) = n{2a + o(l)) + |*|no(l). 
Putting all together, we have shown that a.s. for any z£C, 

a n (y/nX - zD' 1 ) > n|z|(l - o(l)) - n(2a + o(l)). 
Combined with (|5 .3j) . i/iis gives finally (|5.2[) . 

Remark 5.3 (Invertibility) . £e£ (^4 n )n^i fre a sequence of complex random matrices where 
A n is n x n /or every n 1, defined on a common probability space (£l,A, P). -For every 
uj £ £1, the set U n ^i{\i(A n (u))), . . . , \ n (A n (u))} is at most countable and has thus zero 
Lebesgue measure. Therefore, for all oj £ f2 and a. a. z £ C, k /iave s n (^4 n (a;) — zl) > 
/or all all n 1. Aoie i/tai ()1.5p and theorem \l-4\ imply respectively that for A n = X or 
A n = M , this holds for all z £ C, a.s. on and n>l. 

Appendix A. Logarithmic potential and Hermitization 

Let 'P(C) be the set of probability measures on C which integrate log |-| in a neighbor- 
hood of infinity. For every [i £ V(G), the logarithmic potential of u on C is the function 
f7 M : C — >• (— oo, +oo] defined for every z £ C by 



U fl (z) = - log\z-z'\v(dz') = -(log\-\*fi)(z). (A.l) 
For instance, for the circular law IA\ of density ^ 1 ^{ z e€-.\z\^i}j we have, for every z £ C, 

rr / \ / — log if \A > 1, 

Wl(z)= l^(i-N 2 ) if MO, 

see e.g. [35]. Let P'(C) be the set of Schwartz-Sobolev distributions (generalized functions). 
Since log |-| is Lebesgue locally integrable on C, one can check by using the Fubini theorem 
that XJn is Lebesgue locally integrable on C. In particular, < oo a.e. and Up £ 2?' ((D). 
Since log |-| is the fundamental solution of the Laplace equation in C, we have, in 2?' ((D), 

AU^ = -2iru. (A.2) 

This means that for every smooth and compactly supported "test function" ip : C — > R, 

A(p(z)U^(z) dz = —2ir / <p(z) u(dz). 



Lemma A.l (Unicity). For every v £ 'P(C), ifUn = XJ V a.e. then n = v. 

Proof. Since = U v in £>'(C), we get AU M = AU V in £>'(C). Now (jA~2]) gives /i = v in 
2?'(<D), and thus u = v as measures since /i and i/ are Radon measures. □ 

If A is an n x n complex matrix and Pa{z) '■= det(A— zl) is its characteristic polynomial, 
Up A {z) = - f log \z' - z\ n A (dz') = -- log \det(A - zl)\ = - -log \P A (z)\ 

for every z £ C \ {Ai(^4), . . . , A n (^4)}. We have also the alternative expression 

i r°° 

U^ A {z) = — logdet(^(A- zI)(A-zI)*) = - / log(t) v A -zi{dt). (A.3) 
n Jo 
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The identity (|A.3j) bridges the eigenvalues with the singular values, and is at the heart of 
the following lemma, which allows to deduce the convergence of \ia from the one of va-zI- 
The strength of this Hermitization lies in the fact that in contrary to the eigenvalues, one 
can control the singular values with the entries of the matrix. The price payed here is the 
introduction of the auxiliary variable z and the uniform integrability. We recall that on 
a Borel measurable space (E,£), we say that a Borel function / : E — > It is uniformly 
integrable for a sequence of probability measures (r? n ) n ^i on E when 



We will use this property as follows: if {j] n ) n ^\ converges weakly to r\ and / is continuous 
and uniformly integrable for (r/ n )n>i then / is ^-integrable and lin^^oo J f dr] n = j f rj. 
The idea of using Hermitization goes back at least to Girko [22] . However, the proofs of 
lemmas IA.2I and IA.3I below are inspired from the approach of Tao and Vu [41] . 

Lemma A. 2 (Girko Hermitization). Let (A n ) n ^i be a sequence of complex random ma- 
trices where A n is n x n for every n ^ 1, defined on a common probability space. Suppose 
that for a. a. z € C ; there exists a probability measure v z on [0,oo) such that a.s. 

(i) ( u A„-zi)n^i converges weakly to v z as n — > oo 

(ii) log(-) is uniformly integrable for {yA n -zi) n >\ 

Then there exists a probability measure fi £ V(G) such that 

(j) a.s. {^A n )n^\ converges weakly to fi as n — > oo 
(jj) for a.a. z 6 C, 



Jo 

Moreover, if (-A n ) n >i is deterministic, then the statements hold without the "a.s." 

Proof. Let z and u be such that (i-ii) hold. For every 1 ^ k ^ n, define 

a n ,k ■= \^k(A n - zl)\ and b H:k := s k (A n - zl) 

and set v := v z . Note that fiA„-zi = l±A n * Thanks to the Weyl inequalities ([B.6P 
and to the assumptions (i-ii), one can use lemma lA. 3 1 below, which gives that {pA n )n^i is 
tight, that log \z — -\ is uniformly integrable for (fJ-A n )n^i, and that 



Consequently, a.s. \x G V(G) and = U a.e. for every adherence value /i of (fJ-A n )n^l- 
Now, since U does not depend on fj,, by lemma lAT] a.s. (/M n ) n >i has a unique adherence 
value (j,, and since (/i n )n>i is tight, (fJ>A„)n^i converges weakly to \i by the Prohorov 
theorem. Finally, by (|A.2j) . fi is deterministic since U is deterministic, and (j-jj) hold. □ 

The following lemma is in a way the skeleton of the Girko Hermitization of lemma IA.21 
It states essentially a propagation of a uniform logarithmic integrability for a couple of 
triangular arrays, provided that a logarithmic majorization holds between the arrays. 

Lemma A. 3 (Logarithmic majorization and uniform integrability). Let (on,fc)i^fc!gn and 
(b n ,k)i^k^n be two triangular arrays in [0, oo). Define the discrete probability measures 




(A.4) 








and 




k=l 



k=l 



If the following properties hold 

(i) a n ,i ^ • • • > a n ,n and b n> i ^ ^ b n<n for n > 1, 

(ii) nLi a n,i < n!Li b n ,i for every l^k^nforn^>l 
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(iii) niLfc b n,i < n™=fe a n,i for every 1 < k < n for n > 1, 

(iv) (f n ) n ^i converges weakly to some probability measure v as n — >• oo, 

(v) log(-) is uniformly integr able for {u n )n^i, 
then 

(j) (Mn)n>l w %H 

(jj) log(-) is uniformly integrable for (/i n ) n ^i, 

(jjj) we frave, as n —>■ oo, 



\og{t) Hn{dt) = \og{t)v n {dt) -> / log(t)i/(dt), 
and in particular, for every adherence value fi of (/i ra )n^i> 

3 (>oo 

log(t)/z(dt) = / log(i)i/(di). 

J 

Proof. Proof of (jjj). From the logarithmic majorizations (ii-iii) we get, for n>l, 

n n 

J\ a n,k = JJ bn,k, 
fc=l fe=l 

and (v) gives 6 ni fc > and a ni fc > for every 1 ^ k ^ n and n>l. Now, (iv-v) give 

n 

.,k 



/ log(rj) Hn{dt) = - log TT a n . A 
1 " 

= - log TT 

fc=l 

oo 

log(t)i/„(dt) -> / \og{t)u{dt). 







Proof of (j). From (ii) and (v) we get 

^ /"OO 

sup V"log(o n ,i) < sup V^log(Vi) and C := sup / | log(s)| ^„(ds) < oo 
respectively. Now the tightness of (/i n )n^i follows from 

/oo 
log(s) i/„(cfe) < C. (A.5) 

Proof of (jj). We start with the uniform integrability in the neighborhood of infinity. 
Let us show that for n>l, for any e > there exists t ^ 1 such that 



\og(s)n n (ds) < e. (A.6) 

Jt 

If z/((l,oo)) = then (iv) implies 

/oo 
log(t)i/ n (dt) < e 

for n>l and (|A.6p follows then from ()A.5p , If otherwise z/((l,oo)) > then 

/oo 
log(i) u(dt) > 

and one can assume that e < c. Let us show that there exists a sequence of integers 
(&n)n>l such that lmin^oo k n /n — > a > and for n»l, 

sup - V log(6„,i) < e. (A.7) 
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For < e/2 < c, let t be the innmum over all s > 1 such that 

3 1 
log(it) v(du) < -£. 

There exists s ^ i such that v({s}) = 0, and from (v) we get 

/"OO /'OO 2 

lim i/ n ((s, oo)) = z/((s, oo)) ^ and lim / log(u) v n (du) = / log(-u) u(du) ^ —e. 

n— >oo n— >oo J s J g 2 

If f((s, oo)) > then ()A.7p holds with <r := ^((s, oo)) and k n := L^fn((-s, oo))J . Otherwise, 
v((s,oo)) = 0, and if k' n := [n^ n ((s, oo))J then lhm^oo k' n /n = 0, while for any 5 > 0, 

- ^ log(6„,i) < ! + <Hog(«). 

i=l 

Taking /c n := A4 + |_?t<5J with 5 small enough, we deduce that (|A.7|) holds. We have thus 
shown that ()A.7p holds in all case. Now, from (ii) and (|A.7p we get for every 1 ^ ^ k n , 



1 

-Vl lo g( a n,i) < e. 
In particular, by using (i), we get log(a nj fc n ) ^ sn/k n and 

log(u)n n (du) < e. 



i/kn 



Since lim n ^, c ^, k n /n = 5 > 0, we deduce that (|A.6j) holds with t := e eS . Now, by following 
the same reasoning, with (ii) replaced by (hi), we obtain that for all e > 0, there exists 
< t < 1 such that for n»l, 



/ log(s)fin(ds) < e, 
Jo 



which is the counterpart of (|A.6j) needed for (jj). □ 

Remark A. 4 (Other fundamental aspects of the logarithmic potential). The logarithmic 
potential is related to the Cauchy-Stieltjes transform of fi via 



Sn(z) := f — fi(dz') = (d x — id y )U fJ- (z) and thus (d x + id y )S )J 

J c z — z 



-2"7T/i 



mP'(C). The term "logarithmic potential" comes from the fact that Un is the electrostatic 
potential of \i viewed as a distribution of charges in C = R 2 [35J . The logarithmic energy 



£(jjl) := / Un(z) fi(dz) = - / log \z - z'\ n(dz)ix(dz') 
Jc Jc Je 

is up to a sign the Voiculescu free entropy of h in free probability theory |44] . The circular 



lawlA a minimizes fi i-> £ (//) under a second moment constraint [35]. In the spirit of (|A.3D 
and beyond matrices, the Brown [2] spectral measure of a nonnormal bounded operator 
a is \i a := {-An)- l /\$™\og{t)v a „ zI {dt) where v a -zi is the spectral distribution of the 
self-adjoint operator (a — zl)(a — zl)* . Due to the logarithm, the Brown spectral measure 
Ha depends dis continuously on the ^-moments of a [38] . For random matrices, this 
problem is circumvented in the Girko Hermitization by requiring a uniform integrability, 
which turns out to be a.s. satisfied for random matrices such as n~ 1 / 2 X or y/nM . 
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Appendix B. General spectral estimates 

We gather in this section useful lemmas on deterministic matrices. We provide mainly 
references for the most classical results, and sometimes proofs for the less classical ones. 

Lemma B.l (Basic inequalities [28J). If A and B are n x n complex matrices then 

si (AB) < si(A)si(5) and si(A + B) < 8l (A) + si(S) (B.l) 

and 

max \si(A) — Si(B)\ < si(A - B) (B.2) 

l^i^n 

and 

s n (AB) ^ s n (A)s n (B). (B.3) 

Moreover, if A = D is diagonal, then for every 1 ^ i ^ n 

s n (D) Si (B) ^ Sl {DB) < s\(D)si(B). (B.4) 

Lemma B.2 (Rudelson-Vershynin row bound). Let A be a complex n x n matrix with 
rows R\, . . . , R n . Define the vector space R-i := span{i?j : j ^ i}. We have then 

nT 1 / 2 min dist(i?i, R-i) ^ s n (A) ^ min dist(i?j, R-i). 

l^i^n l^iSJn 

The argument behind lemma lB~2l is buried in [33] • We give a proof below for convenience. 

Proof of lemma [B.2l Since A, A T have same singular values, one can consider the columns 
Ci, . . . , C n of A instead of the rows. For every column vector ieC™ and 1 ^ i ^ n, the 
triangle inequality and the identity Ax = x\C\ + • • • + x n C n give 

|| Ax\\2 ^ dist(Ax, C-i) = min ||Ax — y|| 2 = min [|xjCj — y\\ 2 = |xj|dist(Cj, C_j). 

If ||x|| 2 = 1 then necessarily |a?j| ^ n -1 / 2 for some 1 ^ i ^ n and therefore 
s n (A) = min ||Ax|| 2 n -1 / 2 min dist(Cj, C_j). 

||x|| 2 =l l^i^n 

Conversely, for every 1 ^ i ^ n, there exists a vector y with y.i = 1 such that 

dist(C7j,C_j) = ||yiCi H h y n C n || 2 = ||Ay|| 2 ^ ||y|| 2 min ||Ac|| 2 ^ s„(A) 

IMI 2 =i 

where we used the fact that ||y|| 2 = |yi| 2 + • • • + |y n | 2 ^ \yi\ 2 = 1. □ 

Recall that the singular values s\(A), . . . , s n '(A) of a rectangular n' x n complex matrix 
A with n' ^ n are defined by Si (A) := \ t {yfAA*) for every 1 ^ i ^ n . 

Lemma B.3 (Tao-Vu negative second moment |41[ lem. A4]). If A is a full rank n' x n 
complex matrix (n' ^ n) with rows R\, . . . , R n > , and R-i := span{i?j : j ^ i}, then 

n' n' 

Y,Si(A)- 2 = Y,dist(Ri,R-i)- 2 . 

1=1 1=1 

Lemma B.4 (Cauchy interlacing by rows deletion [25]). Let A be an nxn complex matrix. 
If B is n' x n, obtained from A by deleting n — n' rows, then for every 1 i ^ n' , 

Si{A) ^ 8i(B) ^ s i+n - n '(A). 

Lemma fB.41 gives [s n /(B), si(B)] C [s n (A), s\(A)], i.e. row deletions produce a compres- 
sion of the singular values interval. Another way to express this phenomenon consists in 
saying that if we add a row to B then the largest singular value increases while the smallest 
is diminished. Closely related, the following result on finite rank additive perturbations. 
If A is an n x n complex matrix, let us set Si(A) := +oo if i < 1 and Si(A) := if i > n. 
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Lemma B.5 (Thompson-Lidskii interlacing for finite rank perturbations |42j ) . For any 
n x n complex matrices A and B with rank(^4 — B) ^ k, we have, for any i € {1, . . . , n}, 

Si _ k {A) > Si {B) > s i+k (A). (B.5) 

Even if lemma lB.51 gives nothing on the extremal singular values Si(B) where i ^ k or 
n — i < k, it provides however the useful "bulk" inequality \\Fa — Fb\\oo ^ rank(^4 — B)/n 
where Fa and Fb are the cumulative distribution functions of va and v& respectively. 

Lemma B.6 (Weyl inequalities [46 J). For every n x n complex matrix A, we have 

k k n n 

H\\i(A)\ ^Y[ Si (A) and J] Si (A) < JJ \Xi(A)\ (B.6) 

i=l i=l i=k i=k 

for all 1 ^ k ^ n, with equality for k = n. In particular, by viewing |det(A)| as a volume, 

n n n 

| det(A)\ = J] \X k (A)\ = J] s k (A) = J] dist(R k ,span{Ri, R k -i}) (B.7) 
fc=i fc=i fc=i 

where R\,..., R n are the rows of A. Moreover, for every increasing function if from (0, oo) 
to (0,oo) such that t h-» ^(e*) is convex on (0,oo) and tp(0) := lim t __ !> o+ = 0; we have 

k k 

J>(|A^)| 2 K5>(^) 2 ) (B.8) 
i=i i=i 

for every 1 ^ k ^ n. In particular, with ip{t) = t for every t > and k = n, we obtain 

n n n 

^|A fc (^)| 2 <^, fc ^) 2 = Tr(^)= £ \A id \\ (B.9) 

k=l k=l i,j=l 

It is worthwhile to mention that (|B.5|) and (|B.6[) are optimal in the sense that every 
sequences of numbers satisfying these inequalities are associated to matrices, see [4"2j [27], 

Appendix C. Additional lemmas 

Lemma IC . 1 1 b elow is used in the proof of theorem ll.il We omit its proof since it follows 
for instance quite easily from the Paul Levy criterion on characteristic functions. 

Lemma C.l (Convergence under uniform perturbation). Let (a n ,fc)i^fc^n a- n d (^n,fc)i^fc^n 
be triangular arrays of complex numbers. Let ^ be a probability measure on C. 



1 n 1 n 

If ~y^Sa nk fj, and lim max \a nk - b n k\ = then ~y^6 bnk fx. 

k=l k=l 

Lemma IC. 21 below is used for the rows of random matrices in the proof of theorem 11.31 

Lemma C.2 (Tao-Vu distance lemma [41} prop. 5.1]). Let (Xiji^i be i.i.d. random 
variables on C with finite positive variance a 2 := E(|Xi — EXi| 2 ). For n^> 1 and every 
deterministic subspace H of C n with 1 ^ dim(H) ^ n — n a ", setting R := (X\, . . . , X n ), 



Pjdist^F) ^ - V™-dim(#)j ^ exp(-n a01 ). 

The proof of lemma IC.2I is based on a concentration inequality for convex Lipschitz 
functions and product measures due to Talagrand see also [301 cor - 4.9]. The power 
0.01 is used here to fix ideas and is obviously not optimal. This is more than enough for 
our purposes (proof of theorem II. 3|) . A careful reading of the proof of theorem 11.31 shows 
that a polynomial bound on the probability with a large enough power on n suffices. 

We end up this section by a lemma used in the proof of theorem 11.41 
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Lemma C.3 (A special matrix). For every w £ C, let us define the n x n complex matrix 

(\ ••• N 



A„ = I — w 



V 



Then for every z E C we have S2(A^) = • • • = s n -i(A^) = 1 for n^> 1 while 
lim s n (A^) = lim si(A^) = 



'2 + M 2 + |z|i/4 + |z| 2 
and the convergence is uniform on every compact subset o/C 

Proof. Note that Ao = / and A W A W > = A ww i_^ w+w i^ for every w,w' G C. Moreover, is 
invertible if and only if it; ^ 1 and in that case (A w )~ l = A w /r w _i\. It is a special case of 
the Sherman-Morrison formula for the inverse of rank one perturbations. It is immediate 
to check that s±(A w — I) = \\A W — /H2-4.2 = v^M f° r every w £ C An elementary explicit 
computation reveals that the symmetric matrix Ay^A^ — I has rank at most 2, and thus 
A w has at least n — 2 singular values equal to 1 and in particular s n (A w ) ^ 1 ^ s%(A w ). 
From now, let us fix z G C and set w = n~~ l / 2 z and A = A w for convenience. The matrix 
A is nonsingular for ti>l since w — > as n — > 00. Also, we have s n (A) > for n 3> 1. 
Since A is lower triangular with eigenvalues 1 — u>,l,...,l, by (|B.7|) . 

n n 

\l-w\=Y[\Xi(A)\ = \det(A)\ = Y[ Si (A) = u_u + (C.l) 
i=i i=i 
where U- ^ ti+ are two singular values of A. We have also 

u 2 _+u\ + (n-2) = Sl {A) 2 + ■■■ + s n {A) 2 = Tr(AA*) = |1 - w\ 2 + (n - 1)(1 + |w| 2 ) 

which gives u 2 _ + tt 2 ^ = 1 + |1 — w\ 2 + (n — l)|u;| 2 . Combined with (jC.ip . we get that u 2 ^ 
are the solution of X 2 — (1 + (n — l)|tf | 2 + |1 — u> | 2 )A~ + |1 — w| 2 = 0. This gives 



2i4 = 2 + \z\ 2 + 0(n" 1/2 ) ± |z|^/4 + \z\ 2 + 0{n- 1 / 2 ) 

and the 0(n -1//2 ) is uniform in z on every compact. From this formula we get that u_ ^ 1 
and u+ ^ 1 for n S> 1, and thus tt_ = s n (A) and u + = s\{A). □ 

The result of lemma IC.3I is more than enough for our purposes. More precisely, a 
careful reading of the proof of theorem 11.41 shows that a polynomial (in n) lower bound 
on s n (A n -i/2 z ) for n>l, uniformly on compact sets on z, is actually enough. 
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Figure 1. Here we have fixed n = 250, and X\ : \ follows the Bernoulli 
law \{8q + Si). In both graphics, the solid circle has radius rrT l a = 1. 
The left hand side graphic is the superposition of the plot of A2, . . . , A n for 
10 i.i.d. simulations of y/riM, made with the GNU Octave free software. 
The right hand side graphic is the Vorono'i tessellation of A2 , . . . , A n for a 
single simulation of y/nM. Since ^/nM has real entries, its spectrum is 
symmetric with respect to the real axis. On the left hand side graphic, 
it seems that the spectrum is slightly more concentrated on the real axis. 
This phenomenon, which disappears as n — > 00, was already described for 
random matrices with i.i.d. real Gaussian entries by Edelman [19) . see also 
the work of Akemann and Kanzieper [2]. Our simulations suggest that 
theorem 11.31 remains valid beyond the bounded density assumption. 
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